Marathi Document: Similarity Measurement using Semantics-based Dimension Reduction Technique
نویسندگان
چکیده
منابع مشابه
Text Document Clustering Using Dimension Reduction Technique
Text document clustering is used to group a set of documents based on the information it contains and to provide retrieval results when a user browses the internet. Experimental evidences have shown that Information Retrieval applications can benefit from document clustering and it has been used as a tool to improve the performance of retrieval of information. Information retrieval is an interd...
متن کاملComparing Dimension Reduction Techniques for Document Clustering
In this research, a systematic study is conducted of four dimension reduction techniques for the text clustering problem, using five benchmark data sets. Of the four methods -Independent Component Analysis (ICA), Latent Semantic Indexing (LSI), Document Frequency (DF) and Random Projection (RP) -ICA and LSI are clearly superior when the k-means clustering algorithm is applied, irrespective of t...
متن کاملImproving Document Similarity Measurement for Mobile Environment with Document Extension
This paper presents a new method for searching for documents which have similar topics to a given set of documents. It is designed to help mobile device users to search for documents in a peer-to-peer environment which have similar topic to the ones on the users own device. The algorithms are designed for slower processors, smaller memory and small data traffic between the devices. These featur...
متن کاملA Novel Dimension Reduction Technique based on Correlation Coefficient
In this paper, a novel simple dimension reduction technique for classification is proposed based on correlation coefficient. Existing dimension reduction techniques like LDA is known for capturing the most discriminant features of the data in the projected space while PCA is known for preservin g the most descriptive ones after projection. Our novel technique integrates correlation coefficient ...
متن کاملDocument Retrieval using Predication Similarity
Document retrieval has been an important research problem over many years in the information retrieval community. State-of-the-art techniques utilize various methods in matching documents to a given document including keywords, phrases, and annotations. In this paper, we propose a new approach for document retrieval that utilizes predications (subject-predicate-object triples) extracted from th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2020
ISSN: 2156-5570,2158-107X
DOI: 10.14569/ijacsa.2020.0110419